Covariate Powered Cross-Weighted Multiple Testing

نویسندگان

چکیده

A fundamental task in the analysis of datasets with many variables is screening for associations. This can be cast as a multiple testing task, where objective achieving high detection power while controlling type I error. We consider $m$ hypothesis tests represented by pairs $((P_i, X_i))_{1\leq i \leq m}$ p-values $P_i$ and covariates $X_i$, such that $P_i \perp X_i$ if $H_i$ null. Here, we show how to use information potentially available about heterogeneities among hypotheses increase compared conventional procedures only $P_i$. To this end, upgrade existing weighted through Independent Hypothesis Weighting (IHW) framework data-driven weights are calculated function covariates. Finite sample guarantees, e.g., false discovery rate (FDR) control, derived from cross-weighting, data-splitting approach enables learning weight-covariate without overfitting long partitioned into independent folds, arbitrary within-fold dependence. IHW has increased methods do not covariate information. key implication rejection common setups should proceed according ranking p-values, but an alternative implied covariate-weighted p-values.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Importance-Weighted Cross-Validation for Covariate Shift

A common assumption in supervised learning is that the input points in the training set follow the same probability distribution that the input points used for testing follow. However, this assumption is not satisfied, for example, when the outside of training region is inter/extrapolated. The situation where the training input points and test input points follow different distributions is call...

متن کامل

Covariate Shift Adaptation by Importance Weighted Cross Validation

A common assumption in supervised learning is that the input points in the training set follow the same probability distribution as the input points that will be given in the future test phase. However, this assumption is not satisfied, for example, when the outside of the training region is extrapolated. The situation where the training input points and test input points follow different distr...

متن کامل

Weighted multiple testing correction for correlated tests.

Virtually all clinical trials collect multiple endpoints that are usually correlated. Many methods have been proposed to control the family-wise type I error rate (FWER), but these methods often disregard the correlation among the endpoints, such as the commonly used Bonferroni correction, Holm procedure, Wiens' Bonferroni fixed-sequence (BFS) procedure and its extension, and the alpha-exhausti...

متن کامل

Weighted False Discovery Rate Control in Large-Scale Multiple Testing

The use of weights provides an effective strategy to incorporate prior domain knowledge in large-scale inference. This paper studies weighted multiple testing in a decisiontheoretic framework. We develop oracle and data-driven procedures that aim to maximize the expected number of true positives subject to a constraint on the weighted false discovery rate. The asymptotic validity and optimality...

متن کامل

Partial Knowledge in Multiple-Choice Testing

The intent of this study was to discover the nature of (partial) knowledge as estimated by the multiple-choice (MC) test method. An MC test of vocabulary, including 20 items, was given to 10 participants. Each examinee was required to think aloud while focusing on each item before and while making a response. After each test taker was done with each item, s/he was ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of The Royal Statistical Society Series B-statistical Methodology

سال: 2021

ISSN: ['1467-9868', '1369-7412']

DOI: https://doi.org/10.1111/rssb.12411